Don't stop pools #1199

shsms · 2025-04-15T16:28:11Z

When there are multiple pool instances controlling the same components and one of them stops, that would stop all the pools.

This PR adds an emergency fix to prevent this bug, to buy some time to come up with a better solution.

Copilot

Copilot reviewed 4 out of 4 changed files in this pull request and generated no comments.

Comments suppressed due to low confidence (3)

src/frequenz/sdk/timeseries/pv_pool/_pv_pool.py:181

Removing this cleanup call might leave pending tasks or open channels. Consider verifying that _pool_ref_store's resources are managed appropriately elsewhere, and document the reasoning for this removal.

await self._pool_ref_store.stop()

src/frequenz/sdk/timeseries/ev_charger_pool/_ev_charger_pool.py:219

Ensure that omitting the stop operation on _pool_ref_store doesn't lead to resource leaks. If resource cleanup is handled in another part of the code, an inline comment explaining this would improve clarity.

await self._pool_ref_store.stop()

src/frequenz/sdk/timeseries/battery_pool/_battery_pool.py:401

Double-check that the removal of the stop call does not leave orphaned tasks or channels. Consider adding tests or documentation to confirm that resource management remains intact.

await self._pool_ref_store.stop()

ela-kotulska-frequenz

Wow but the formulas are shared, too.
So we have the same problem if 2 actors uses the same formula (like BatteryPool.power()) and one of them stops it.
And the same for microgrid.producer().power - one actor stops it, it won't work for other actors.

So this PR won't fix the problem :/
Maybe quick fix would be to not share pools & formulas? (I don't know how much work it would be).
Or have counter (how many is active and how many stopped

ela-kotulska-frequenz · 2025-04-16T07:36:24Z

src/frequenz/sdk/timeseries/ev_charger_pool/_ev_charger_pool.py

    async def stop(self) -> None:
        """Stop all tasks and channels owned by the EVChargerPool."""
-        await self._pool_ref_store.stop()


Could you please add comment why this methods are empty?
It looks like bug now, because method description and definition does different thing.

llucax · 2025-04-16T08:05:19Z

👋 reference counting 👋

When there are multiple pool instances controlling the same components and one of them stops, that would stop all the pools. This commit adds an emergency fix to prevent this bug, to buy some time to come up with a better solution. Signed-off-by: Sahas Subramanian <[email protected]>

Signed-off-by: Sahas Subramanian <[email protected]>

llucax

Approving because I guess if you are making a PR with such a hack, you really need it urgently, but I think the release notes are a bit misleading. You fixed a bug by introducing a new bug (pools are not stopped) 😆

shsms · 2025-04-16T08:26:03Z

And the same for microgrid.producer().power - one actor stops it, it won't work for other actors.

This is not a problem. Users only want to close the resources that they have created. So when they make a new_receiver() and have a reference to the receiver, they will stop it. The same is true with new_battery_pool because they are creating it.

We don't have the idiom of holding references to built-in formulas. When people create custom formulas, they can stop those; that's fine.

Also, when formulas are closed by mistake, it's immediately noticeable. But when the battery pool is closed by mistake and we're only subscribing to SOC or capacity, for example, it's much more dangerous because we'll just assume the value hasn't changed.

Maybe quick fix would be to not share pools & formulas?

Because there are so many metrics, that would quickly become a noticeable overhead. Additionally, the pools share access to the power manager, etc.

I think we should just look for a proper solution, hopefully next week.

Copilot AI review requested due to automatic review settings April 15, 2025 16:28

shsms requested a review from a team as a code owner April 15, 2025 16:28

shsms requested review from ela-kotulska-frequenz and removed request for a team April 15, 2025 16:28

github-project-automation bot added this to Python SDK Roadmap Apr 15, 2025

github-project-automation bot moved this to To do in Python SDK Roadmap Apr 15, 2025

github-actions bot added part:docs Affects the documentation part:data-pipeline Affects the data pipeline labels Apr 15, 2025

Copilot AI reviewed Apr 15, 2025

View reviewed changes

ela-kotulska-frequenz reviewed Apr 16, 2025

View reviewed changes

shsms added 2 commits April 16, 2025 10:14

Update release notes

6c31fbc

Signed-off-by: Sahas Subramanian <[email protected]>

shsms force-pushed the don't-stop-pools branch from 08ac282 to 6c31fbc Compare April 16, 2025 08:14

llucax approved these changes Apr 16, 2025

View reviewed changes

github-project-automation bot moved this from To do to Review approved in Python SDK Roadmap Apr 16, 2025

shsms added this pull request to the merge queue Apr 16, 2025

Merged via the queue into frequenz-floss:v1.x.x with commit df61636 Apr 16, 2025
5 checks passed

shsms deleted the don't-stop-pools branch April 16, 2025 08:39

github-project-automation bot moved this from Review approved to Done in Python SDK Roadmap Apr 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Don't stop pools #1199

Don't stop pools #1199

Uh oh!

shsms commented Apr 15, 2025 •

edited by llucax

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

ela-kotulska-frequenz left a comment •

edited

Loading

Uh oh!

ela-kotulska-frequenz Apr 16, 2025

Uh oh!

shsms Apr 16, 2025

Uh oh!

llucax commented Apr 16, 2025

Uh oh!

llucax left a comment

Uh oh!

shsms commented Apr 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Don't stop pools #1199

Don't stop pools #1199

Uh oh!

Conversation

shsms commented Apr 15, 2025 • edited by llucax Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

ela-kotulska-frequenz left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ela-kotulska-frequenz Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

shsms Apr 16, 2025

Choose a reason for hiding this comment

Uh oh!

llucax commented Apr 16, 2025

Uh oh!

llucax left a comment

Choose a reason for hiding this comment

Uh oh!

shsms commented Apr 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shsms commented Apr 15, 2025 •

edited by llucax

Loading

ela-kotulska-frequenz left a comment •

edited

Loading